Dataset statistics
| Number of variables | 22 |
|---|---|
| Number of observations | 3883 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 667.5 KiB |
| Average record size in memory | 176.0 B |
Variable types
| Numeric | 3 |
|---|---|
| Categorical | 19 |
Title has a high cardinality: 3883 distinct values | High cardinality |
df_index is highly correlated with MovieID | High correlation |
MovieID is highly correlated with df_index | High correlation |
Animation is highly correlated with Children's | High correlation |
Children's is highly correlated with Animation | High correlation |
df_index is uniformly distributed | Uniform |
MovieID is uniformly distributed | Uniform |
Title is uniformly distributed | Uniform |
df_index has unique values | Unique |
MovieID has unique values | Unique |
Title has unique values | Unique |
Reproduction
| Analysis started | 2022-07-14 03:03:31.354740 |
|---|---|
| Analysis finished | 2022-07-14 03:05:12.820504 |
| Duration | 1 minute and 41.47 seconds |
| Software version | pandas-profiling v3.2.0 |
| Download configuration | config.json |
| Distinct | 3883 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1941 |
| Minimum | 0 |
|---|---|
| Maximum | 3882 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 30.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 194.1 |
| Q1 | 970.5 |
| median | 1941 |
| Q3 | 2911.5 |
| 95-th percentile | 3687.9 |
| Maximum | 3882 |
| Range | 3882 |
| Interquartile range (IQR) | 1941 |
Descriptive statistics
| Standard deviation | 1121.069876 |
|---|---|
| Coefficient of variation (CV) | 0.5775733518 |
| Kurtosis | -1.2 |
| Mean | 1941 |
| Median Absolute Deviation (MAD) | 971 |
| Skewness | 0 |
| Sum | 7536903 |
| Variance | 1256797.667 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3882 | 1 | < 0.1% |
| 3647 | 1 | < 0.1% |
| 2686 | 1 | < 0.1% |
| 2080 | 1 | < 0.1% |
| 3319 | 1 | < 0.1% |
| 2064 | 1 | < 0.1% |
| 1373 | 1 | < 0.1% |
| 3658 | 1 | < 0.1% |
| 1374 | 1 | < 0.1% |
| 3322 | 1 | < 0.1% |
| Other values (3873) | 3873 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 3882 | 1 | |
| 3881 | 1 | |
| 3880 | 1 | |
| 3879 | 1 | |
| 3878 | 1 | |
| 3877 | 1 | |
| 3876 | 1 | |
| 3875 | 1 | |
| 3874 | 1 | |
| 3873 | 1 |
| Distinct | 3883 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1986.049446 |
| Minimum | 1 |
|---|---|
| Maximum | 3952 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 30.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 196.1 |
| Q1 | 982.5 |
| median | 2010 |
| Q3 | 2980.5 |
| 95-th percentile | 3756.9 |
| Maximum | 3952 |
| Range | 3951 |
| Interquartile range (IQR) | 1998 |
Descriptive statistics
| Standard deviation | 1146.778349 |
|---|---|
| Coefficient of variation (CV) | 0.5774168169 |
| Kurtosis | -1.214552412 |
| Mean | 1986.049446 |
| Median Absolute Deviation (MAD) | 999 |
| Skewness | -0.01908154472 |
| Sum | 7711830 |
| Variance | 1315100.583 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3952 | 1 | < 0.1% |
| 3716 | 1 | < 0.1% |
| 2755 | 1 | < 0.1% |
| 2149 | 1 | < 0.1% |
| 3388 | 1 | < 0.1% |
| 2133 | 1 | < 0.1% |
| 1394 | 1 | < 0.1% |
| 3727 | 1 | < 0.1% |
| 1395 | 1 | < 0.1% |
| 3391 | 1 | < 0.1% |
| Other values (3873) | 3873 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 |
| Value | Count | Frequency (%) |
| 3952 | 1 | |
| 3951 | 1 | |
| 3950 | 1 | |
| 3949 | 1 | |
| 3948 | 1 | |
| 3947 | 1 | |
| 3946 | 1 | |
| 3945 | 1 | |
| 3944 | 1 | |
| 3943 | 1 |
| Distinct | 3883 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.5 KiB |
| Contender, The (2000) | 1 |
|---|---|
| Fatal Beauty (1987) | 1 |
| Light of Day (1987) | 1 |
| House II: The Second Story (1987) | 1 |
| Harry and the Hendersons (1987) | 1 |
| Other values (3878) |
Length
| Max length | 82 |
|---|---|
| Median length | 68 |
| Mean length | 24.20267834 |
| Min length | 8 |
Characters and Unicode
| Total characters | 93979 |
|---|---|
| Distinct characters | 98 |
| Distinct categories | 10 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 3883 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Contender, The (2000) |
|---|---|
| 2nd row | Whipped (2000) |
| 3rd row | Big Momma's House (2000) |
| 4th row | Isn't She Great? (2000) |
| 5th row | Shanghai Noon (2000) |
Common Values
| Value | Count | Frequency (%) |
| Contender, The (2000) | 1 | < 0.1% |
| Fatal Beauty (1987) | 1 | < 0.1% |
| Light of Day (1987) | 1 | < 0.1% |
| House II: The Second Story (1987) | 1 | < 0.1% |
| Harry and the Hendersons (1987) | 1 | < 0.1% |
| Adventures in Babysitting (1987) | 1 | < 0.1% |
| Raising Arizona (1987) | 1 | < 0.1% |
| Near Dark (1987) | 1 | < 0.1% |
| Tin Men (1987) | 1 | < 0.1% |
| Who's That Girl? (1987) | 1 | < 0.1% |
| Other values (3873) | 3873 |
Length
| Value | Count | Frequency (%) |
| the | 1251 | 8.0% |
| of | 364 | 2.3% |
| 1996 | 345 | 2.2% |
| 1995 | 342 | 2.2% |
| 1998 | 337 | 2.1% |
| 1997 | 315 | 2.0% |
| 1999 | 283 | 1.8% |
| 1994 | 257 | 1.6% |
| a | 170 | 1.1% |
| 1993 | 165 | 1.1% |
| Other values (4646) | 11846 |
Most occurring characters
| Value | Count | Frequency (%) |
| 11792 | 12.5% | |
| e | 6698 | 7.1% |
| 9 | 6463 | 6.9% |
| a | 4224 | 4.5% |
| ) | 4153 | 4.4% |
| ( | 4153 | 4.4% |
| 1 | 3949 | 4.2% |
| o | 3882 | 4.1% |
| n | 3537 | 3.8% |
| r | 3421 | 3.6% |
| Other values (88) | 41707 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 46031 | |
| Decimal Number | 15821 | 16.8% |
| Space Separator | 11792 | 12.5% |
| Uppercase Letter | 10193 | 10.8% |
| Close Punctuation | 4153 | 4.4% |
| Open Punctuation | 4153 | 4.4% |
| Other Punctuation | 1766 | 1.9% |
| Dash Punctuation | 68 | 0.1% |
| Currency Symbol | 1 | < 0.1% |
| Other Number | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 6698 | |
| a | 4224 | |
| o | 3882 | 8.4% |
| n | 3537 | 7.7% |
| r | 3421 | 7.4% |
| i | 3406 | 7.4% |
| t | 3176 | 6.9% |
| s | 2510 | 5.5% |
| h | 2392 | 5.2% |
| l | 2269 | 4.9% |
| Other values (32) | 10516 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1391 | |
| S | 868 | 8.5% |
| M | 730 | 7.2% |
| B | 702 | 6.9% |
| C | 616 | 6.0% |
| A | 587 | 5.8% |
| D | 522 | 5.1% |
| L | 497 | 4.9% |
| P | 470 | 4.6% |
| F | 466 | 4.6% |
| Other values (19) | 3344 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1025 | |
| ' | 224 | 12.7% |
| : | 198 | 11.2% |
| . | 193 | 10.9% |
| ! | 44 | 2.5% |
| & | 41 | 2.3% |
| ? | 17 | 1.0% |
| / | 16 | 0.9% |
| * | 6 | 0.3% |
| ; | 1 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 6463 | |
| 1 | 3949 | |
| 8 | 1113 | 7.0% |
| 7 | 743 | 4.7% |
| 6 | 727 | 4.6% |
| 0 | 718 | 4.5% |
| 5 | 670 | 4.2% |
| 4 | 540 | 3.4% |
| 2 | 490 | 3.1% |
| 3 | 408 | 2.6% |
Space Separator
| Value | Count | Frequency (%) |
| 11792 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 4153 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 4153 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 68 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 1 |
Other Number
| Value | Count | Frequency (%) |
| ³ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 56224 | |
| Common | 37755 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 6698 | 11.9% |
| a | 4224 | 7.5% |
| o | 3882 | 6.9% |
| n | 3537 | 6.3% |
| r | 3421 | 6.1% |
| i | 3406 | 6.1% |
| t | 3176 | 5.6% |
| s | 2510 | 4.5% |
| h | 2392 | 4.3% |
| l | 2269 | 4.0% |
| Other values (61) | 20709 |
Common
| Value | Count | Frequency (%) |
| 11792 | ||
| 9 | 6463 | |
| ) | 4153 | 11.0% |
| ( | 4153 | 11.0% |
| 1 | 3949 | 10.5% |
| 8 | 1113 | 2.9% |
| , | 1025 | 2.7% |
| 7 | 743 | 2.0% |
| 6 | 727 | 1.9% |
| 0 | 718 | 1.9% |
| Other values (17) | 2919 | 7.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 93917 | |
| None | 62 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 11792 | 12.6% | |
| e | 6698 | 7.1% |
| 9 | 6463 | 6.9% |
| a | 4224 | 4.5% |
| ) | 4153 | 4.4% |
| ( | 4153 | 4.4% |
| 1 | 3949 | 4.2% |
| o | 3882 | 4.1% |
| n | 3537 | 3.8% |
| r | 3421 | 3.6% |
| Other values (68) | 41645 |
None
| Value | Count | Frequency (%) |
| é | 25 | |
| è | 6 | 9.7% |
| ö | 4 | 6.5% |
| à | 4 | 6.5% |
| í | 3 | 4.8% |
| ø | 2 | 3.2% |
| É | 2 | 3.2% |
| î | 2 | 3.2% |
| á | 2 | 3.2% |
| ó | 2 | 3.2% |
| Other values (10) | 10 | 16.1% |
year
Real number (ℝ≥0)
| Distinct | 81 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1986.066959 |
| Minimum | 1919 |
|---|---|
| Maximum | 2000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 30.5 KiB |
Quantile statistics
| Minimum | 1919 |
|---|---|
| 5-th percentile | 1946 |
| Q1 | 1982 |
| median | 1994 |
| Q3 | 1997 |
| 95-th percentile | 1999 |
| Maximum | 2000 |
| Range | 81 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 16.89569016 |
|---|---|
| Coefficient of variation (CV) | 0.008507110037 |
| Kurtosis | 2.402797142 |
| Mean | 1986.066959 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | -1.766093575 |
| Sum | 7711898 |
| Variance | 285.4643459 |
| Monotonicity | Decreasing |
| Value | Count | Frequency (%) |
| 1996 | 345 | 8.9% |
| 1995 | 342 | 8.8% |
| 1998 | 337 | 8.7% |
| 1997 | 315 | 8.1% |
| 1999 | 283 | 7.3% |
| 1994 | 257 | 6.6% |
| 1993 | 165 | 4.2% |
| 2000 | 156 | 4.0% |
| 1986 | 104 | 2.7% |
| 1992 | 102 | 2.6% |
| Other values (71) | 1477 |
| Value | Count | Frequency (%) |
| 1919 | 3 | 0.1% |
| 1920 | 2 | 0.1% |
| 1921 | 1 | < 0.1% |
| 1922 | 2 | 0.1% |
| 1923 | 3 | 0.1% |
| 1925 | 6 | |
| 1926 | 8 | |
| 1927 | 6 | |
| 1928 | 3 | 0.1% |
| 1929 | 3 | 0.1% |
| Value | Count | Frequency (%) |
| 2000 | 156 | |
| 1999 | 283 | |
| 1998 | 337 | |
| 1997 | 315 | |
| 1996 | 345 | |
| 1995 | 342 | |
| 1994 | 257 | |
| 1993 | 165 | |
| 1992 | 102 | 2.6% |
| 1991 | 60 | 1.5% |
Action
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.5 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3883 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3380 | |
| 1 | 503 | 13.0% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 3380 | |
| 1 | 503 | 13.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3380 | |
| 1 | 503 | 13.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3883 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3380 | |
| 1 | 503 | 13.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3883 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3380 | |
| 1 | 503 | 13.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3883 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3380 | |
| 1 | 503 | 13.0% |
Adventure
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.5 KiB |
| 0 | |
|---|---|
| 1 | 283 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3883 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3600 | |
| 1 | 283 | 7.3% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 3600 | |
| 1 | 283 | 7.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3600 | |
| 1 | 283 | 7.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3883 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3600 | |
| 1 | 283 | 7.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3883 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3600 | |
| 1 | 283 | 7.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3883 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3600 | |
| 1 | 283 | 7.3% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.5 KiB |
| 0 | |
|---|---|
| 1 | 105 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3883 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3778 | |
| 1 | 105 | 2.7% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 3778 | |
| 1 | 105 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3778 | |
| 1 | 105 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3883 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3778 | |
| 1 | 105 | 2.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3883 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3778 | |
| 1 | 105 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3883 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3778 | |
| 1 | 105 | 2.7% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.5 KiB |
| 0 | |
|---|---|
| 1 | 251 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3883 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3632 | |
| 1 | 251 | 6.5% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 3632 | |
| 1 | 251 | 6.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3632 | |
| 1 | 251 | 6.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3883 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3632 | |
| 1 | 251 | 6.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3883 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3632 | |
| 1 | 251 | 6.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3883 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3632 | |
| 1 | 251 | 6.5% |
Comedy
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.5 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3883 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2683 | |
| 1 | 1200 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 2683 | |
| 1 | 1200 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2683 | |
| 1 | 1200 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3883 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2683 | |
| 1 | 1200 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3883 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2683 | |
| 1 | 1200 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3883 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2683 | |
| 1 | 1200 |
Crime
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.5 KiB |
| 0 | |
|---|---|
| 1 | 211 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3883 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3672 | |
| 1 | 211 | 5.4% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 3672 | |
| 1 | 211 | 5.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3672 | |
| 1 | 211 | 5.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3883 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3672 | |
| 1 | 211 | 5.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3883 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3672 | |
| 1 | 211 | 5.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3883 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3672 | |
| 1 | 211 | 5.4% |
Documentary
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.5 KiB |
| 0 | |
|---|---|
| 1 | 127 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3883 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3756 | |
| 1 | 127 | 3.3% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 3756 | |
| 1 | 127 | 3.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3756 | |
| 1 | 127 | 3.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3883 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3756 | |
| 1 | 127 | 3.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3883 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3756 | |
| 1 | 127 | 3.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3883 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3756 | |
| 1 | 127 | 3.3% |
Drama
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.5 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3883 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2280 | |
| 1 | 1603 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 2280 | |
| 1 | 1603 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2280 | |
| 1 | 1603 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3883 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2280 | |
| 1 | 1603 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3883 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2280 | |
| 1 | 1603 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3883 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2280 | |
| 1 | 1603 |
Fantasy
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.5 KiB |
| 0 | |
|---|---|
| 1 | 68 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3883 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3815 | |
| 1 | 68 | 1.8% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 3815 | |
| 1 | 68 | 1.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3815 | |
| 1 | 68 | 1.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3883 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3815 | |
| 1 | 68 | 1.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3883 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3815 | |
| 1 | 68 | 1.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3883 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3815 | |
| 1 | 68 | 1.8% |
Film-Noir
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.5 KiB |
| 0 | |
|---|---|
| 1 | 44 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3883 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3839 | |
| 1 | 44 | 1.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 3839 | |
| 1 | 44 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3839 | |
| 1 | 44 | 1.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3883 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3839 | |
| 1 | 44 | 1.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3883 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3839 | |
| 1 | 44 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3883 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3839 | |
| 1 | 44 | 1.1% |
Horror
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.5 KiB |
| 0 | |
|---|---|
| 1 | 343 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3883 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3540 | |
| 1 | 343 | 8.8% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 3540 | |
| 1 | 343 | 8.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3540 | |
| 1 | 343 | 8.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3883 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3540 | |
| 1 | 343 | 8.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3883 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3540 | |
| 1 | 343 | 8.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3883 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3540 | |
| 1 | 343 | 8.8% |
Musical
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.5 KiB |
| 0 | |
|---|---|
| 1 | 114 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3883 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3769 | |
| 1 | 114 | 2.9% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 3769 | |
| 1 | 114 | 2.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3769 | |
| 1 | 114 | 2.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3883 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3769 | |
| 1 | 114 | 2.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3883 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3769 | |
| 1 | 114 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3883 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3769 | |
| 1 | 114 | 2.9% |
Mystery
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.5 KiB |
| 0 | |
|---|---|
| 1 | 106 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3883 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3777 | |
| 1 | 106 | 2.7% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 3777 | |
| 1 | 106 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3777 | |
| 1 | 106 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3883 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3777 | |
| 1 | 106 | 2.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3883 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3777 | |
| 1 | 106 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3883 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3777 | |
| 1 | 106 | 2.7% |
Romance
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.5 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3883 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3412 | |
| 1 | 471 | 12.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 3412 | |
| 1 | 471 | 12.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3412 | |
| 1 | 471 | 12.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3883 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3412 | |
| 1 | 471 | 12.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3883 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3412 | |
| 1 | 471 | 12.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3883 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3412 | |
| 1 | 471 | 12.1% |
Sci-Fi
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.5 KiB |
| 0 | |
|---|---|
| 1 | 276 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3883 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3607 | |
| 1 | 276 | 7.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 3607 | |
| 1 | 276 | 7.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3607 | |
| 1 | 276 | 7.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3883 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3607 | |
| 1 | 276 | 7.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3883 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3607 | |
| 1 | 276 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3883 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3607 | |
| 1 | 276 | 7.1% |
Thriller
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.5 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3883 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3391 | |
| 1 | 492 | 12.7% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 3391 | |
| 1 | 492 | 12.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3391 | |
| 1 | 492 | 12.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3883 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3391 | |
| 1 | 492 | 12.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3883 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3391 | |
| 1 | 492 | 12.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3883 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3391 | |
| 1 | 492 | 12.7% |
War
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.5 KiB |
| 0 | |
|---|---|
| 1 | 143 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3883 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3740 | |
| 1 | 143 | 3.7% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 3740 | |
| 1 | 143 | 3.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3740 | |
| 1 | 143 | 3.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3883 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3740 | |
| 1 | 143 | 3.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3883 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3740 | |
| 1 | 143 | 3.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3883 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3740 | |
| 1 | 143 | 3.7% |
Western
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.5 KiB |
| 0 | |
|---|---|
| 1 | 68 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3883 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3815 | |
| 1 | 68 | 1.8% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 3815 | |
| 1 | 68 | 1.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3815 | |
| 1 | 68 | 1.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3883 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3815 | |
| 1 | 68 | 1.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3883 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3815 | |
| 1 | 68 | 1.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3883 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3815 | |
| 1 | 68 | 1.8% |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| df_index | MovieID | Title | year | Action | Adventure | Animation | Children's | Comedy | Crime | Documentary | Drama | Fantasy | Film-Noir | Horror | Musical | Mystery | Romance | Sci-Fi | Thriller | War | Western | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 3882 | 3952 | Contender, The (2000) | 2000 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
| 1 | 3528 | 3597 | Whipped (2000) | 2000 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 2 | 3577 | 3646 | Big Momma's House (2000) | 2000 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 3 | 3170 | 3239 | Isn't She Great? (2000) | 2000 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 4 | 3555 | 3624 | Shanghai Noon (2000) | 2000 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 5 | 3554 | 3623 | Mission: Impossible 2 (2000) | 2000 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
| 6 | 3549 | 3618 | Small Time Crooks (2000) | 2000 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 7 | 3548 | 3617 | Road Trip (2000) | 2000 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 8 | 3547 | 3616 | Loser (2000) | 2000 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 |
| 9 | 3546 | 3615 | Dinosaur (2000) | 2000 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Last rows
| df_index | MovieID | Title | year | Action | Adventure | Animation | Children's | Comedy | Crime | Documentary | Drama | Fantasy | Film-Noir | Horror | Musical | Mystery | Romance | Sci-Fi | Thriller | War | Western | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3873 | 3572 | 3641 | Woman of Paris, A (1923) | 1923 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 3874 | 2161 | 2230 | Always Tell Your Wife (1923) | 1923 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 3875 | 3126 | 3195 | Tess of the Storm Country (1922) | 1922 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 3876 | 1327 | 1348 | Nosferatu (Nosferatu, eine Symphonie des Grauens) (1922) | 1922 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 3877 | 3241 | 3310 | Kid, The (1921) | 1921 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 3878 | 3240 | 3309 | Dog's Life, A (1920) | 1920 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 3879 | 3162 | 3231 | Saphead, The (1920) | 1920 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 3880 | 2754 | 2823 | Spiders, The (Die Spinnen, 1. Teil: Der Goldene See) (1919) | 1919 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 3881 | 3063 | 3132 | Daddy Long Legs (1919) | 1919 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 3882 | 2752 | 2821 | Male and Female (1919) | 1919 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |